Toward Automatic Semantic Annotating and Pattern Mining for Domain Knowledge Acquisition
نویسندگان
چکیده
Due to the high complexity of natural language, acquisition of high quality knowledge for the purpose of fine-grained data processing still mainly relies on manual labour at present, which is extremely laborious and time consuming. In this paper, a new automatic approach using semantic annotating and pattern mining is proposed to assist engineers for domain knowledge acquisition. This approach uses Minipar to label sentences processed from domain texts. Based on the dependency relations, structural patterns are extracted and semantic bank is applied to annotate and represent concepts with semantic labels considering sentence contexts. The approach further learns and assigns relations to previously extracted concepts by pattern matching. The involved concepts and semantic labels with learned relations together, as extracted knowledge, enrich domain knowledge base. Preliminary experiments on Yahoo! Data in “heart diseases” category showed that the proposed approach is feasible for automatic domain knowledge acquisition.
منابع مشابه
Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملTowards Semantic Data Mining
Incorporating domain knowledge is one of the most challenging problems in data mining. The Semantic Web technologies are promising to offer solutions to formally capture and efficiently use the domain knowledge. We call data mining technologies powered by the Semantic Web, capable of systematically incorporating domain knowledge, the semantic data mining. In this paper, we identify the importan...
متن کاملSemantic Information Usage Mining for Next Page Prediction Using Markov Model
Patterns generated by conventional Web Usage Mining methods do not provide explicit insight into the user’s underlying interest and preferences. Hence there is a need to incorporate semantic information in web usage model to understand web user’s navigational behavior at conceptual level. This motivated us to propose the semantically enriched web usage model. The proposed work integrates domain...
متن کاملAutomatic Discovery Of Term Similarities Using Pattern Mining
Term recognition and clustering are key topics in automatic knowledge acquisition and text mining. In this paper we present a novel approach to the automatic discovery of term similarities, which serves as a basis for both classification and clustering of domain-specific concepts represented by terms. The method is based on automatic extraction of significant patterns in which terms tend to app...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013